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Remoie fftsfrumenf Diiiffnosis 
on §he fiffernef 

Nicholas HJIIL Caldwell, Bernard C Breton, and David M* Holburn, University of Cambridge 



tms expert systw^i uses generic toois such as 
Web browsers to ajd service engineers in 
remomi diagnosing the faults in leo scanning 
electron microscopes. The system lets engineers \ 
acquire information dmectlyfrom the sem, 
diagnose inaccurate behavior^ and remedy 
inappr0prl4te instrumeifr settings. 



Stable avenue for idea exchange, researches 
have begun to explore more ways to use it in 
knowledge acquisition. In one such effort, 
researchers in Cambridge University's Engi- 
neering Department are collaborating with 
LEO Electron Microscopy Ltd., to explore 
how knowledge-based systems can improve 
operator interaction with a scanning electron 
microscope, or SEM. A product of that work 
is First AID (Automated Instrument Diag- 
nostics), a fault diagnosis expert system that 
is based on an information client architec- 
ture. The sidebar '*How information clients 
differ from knowledge servers" ejcplains 
more about this type of architecture and how 
First AID differs front existing systems; Like 
other information-client-based systems, First 
AID can access one or more Internet sites to 
obtain pertinent information or data, whid) it 
then uses to assist inference. This contrasts 
to the knowledge-server architectural ap- 
proach,^ in which the system resides on the 
network and reasons on behalf of multiple 
remote clients that provide the interface for 
remote users. 

hi this articlei \ye describe an application 
of First AID for the LEO 400 scries, PG-based 
SEMs, which run on Microsoft Windows. 
First AID lets microscope service engineers 
remotely diagnose. LEO's SEMs, thereby 
avoiding costly travel to customer sites. As 
far as we know, this is the first time the Inter- 
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net has been used for diagnosis in this fashion. 
A recent assessment by service engineers 
indicates tfiat the Hrst AID system can be of 
major benefit in tiae diagnostic task for aver- 
age and expert engineers alike. 

Why remote diagnosis? 

A scanning electron microscope is a so- 
phisticated scientific instrument that, in ad- 
vanced versions, is fully computer-controlled. 
Three decades of commercial manufacturing 
has led to a steady evolution of SEM sophis- 
tication. This, combined with increased man- 
ufacturer competition and a shrinking market, 
have created a critical need for maintenance 
support 

However, SEM fault diagnosis suffers 
firom twomain disadvantages. First, there arc 
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relativiely few experts. The primary reason is 
that a ser\ice engineer is expected to repair . 
any SEM-^finom the oldest surviving instru- 
ment, possibly 30 years bid, to the most mod- j 
em noachines. Acquiring and retaining exper- 
tise for this broad range of SEMs is difficult 
because there are only so many instruments :» 
of a particular model. ; 

Second,: SEM. manufacturers are geo- / 

graphically distant For example, SEM tech- '* 
nical support for LEO is distributed between 

the dual conapany headquarters at Cambridge • ] 

(UK) and Ob^rkochen (Germany) and the , ^ 

local service agencies in various countries. . :•, 

The local service agencies provide the first 5 

level of customer support, referring more dif^ [\ 
ficult cases to headquarters* Thus a telephone 

diagnosis and subsequent treatment can 'I 
require a si re .vi si t, with .<*pare parts being 

ordered fi-om the UK in some cases, Misdi- ;| 
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agnosis, which involves taking wrong or 
unneeded components to a site > costs the 
company time and nioney and the customer 
unnecessarily prolonged downtime. 

Telephone diagnosis is also unsatisfac- 
tory because the SEM is a visual instru- 
ment, with many symptoms appearing as 
imaging defects. To communicate an imag- 
ing defect, the customer must describe the 
syn^Jtom in great detail verbally. The ambi- 
guity of natural language becomes even 
more of an obstacle when one or both par- 
ties must use a foreign language. Tradi- 
tional diagnostic methods are also unsuit- 
able for correcting another possible cause 
of malfunction — an incorrect instrument 
parameter setting. In this situation, the 
SEM appears to be malfunctioning because 
some sequence of actions has caused it to 
reach an inappropriate combination of 
instrument parameters (we describe an 
example later). In the traditional mode of 
diagnosis, the customer must produce an 
exhaustive list of instrument-setting para- 
meters, which is time-consuming. 

System requirements ami 
j overview 

I Given these remote-diagnosis require- 
ments, any SEM fault-diagnosis system must 
provide die service engineer with the knowl- 
edge of the best domain experts as well as 
accurate image aitd tnstiiunent parameter set- 
tings from the SEM itself. 

To meet the first lequirement, we needed 
an expert system. To meet the second require- 
ment, we needed an expert system that could 
acquire information directly from the SEM, 
remotely conduct diagnoses, and remotely 
remedy inappropriate instrument settings. 

At the outset, we knew that a tault-free 
instrum^t was a prerequisite for optimizing 
SEM operation and that we would need the 
same expertise for both SEM operation aitd 
fault diagnosis. We also needed a mote flexi- 
ble environment than, say, an expert shell, 
because the remote-cantrol software and future 
systems would be embedded in the SEM itself. 
Fmally, the research costs for Cambridge Uni- 
versity and the final deployment costs to LEO 

had to be as low as possible. These constraints 

(among others) led us to develop First AID 
fixnn scratch using Logic Programming Asso- 
ciates' Win-Prolog {http://www.lp(L co. uk). 

We conducted the initial knowledge 
acquisition for First AID over four weeks. 



During that time, we drew knowledge pri- 
marily from service manuals and archived 
case histories. We then developed proto- 
heuristics and submitted them to domain 
experts to evaluate and validate. Wc later 



expanded this knowledge base through infor- 
mal interviews with the domain experts and 
by mining relevant documents. 

The next step was to design a rule-based 
representation that simulates the engineer's 



" J : *M- RrUrsson, "Expert Systpm? as Knbwtedgje jScrwra,*'/£/i/?£ij)eit.\fiiL (4: Sk).:3KMay/Juiio ; 
I9%.pp.:14^19;. ; . 

2. /.aCoheB.L..Bci3gmaB,anaPJ*SbopbclI/AHigh5pe^ 

. ipgfroinGaUDchwiftibeKfickTfe^^ 

3! F. HcoiKniKni'c^ al. •*RcouHe Eftvcadiopping whh the Wortd W«lp Web,*** f^rocJL Astrqitom^ 

.'■ cal Ddta AriaivsifS^Jh^Te and Systems Vi Astro nomicaJ Sw. pf the' Pacittc, (99(5, pjp. • 

.' :384-39l. 

4. }. Baruch and M. CbK. "RcmoterCootrol and Robots: Aa tntcmc^ SoluUoh " Ctwipw/iVi^ & 
■ . C^nva/ii>*^.iv Vol. 7, No. 1,1996, pi). 39-45. * 

5. C Wong ct al, ''Dcvelopincm and Deploymci>tr of a Rule-Based. Expert iSystcm for 
: Autonomcmji Satellite Monitorins:/* Proc. Astwnontkal Data Analysis Software and J^- 

fcmi V. AiOiuiwmicalSoc. of Ibc Padfic^ 

6. ■ A. If^augier ci al.* "Remote Ipstnimcnl TfelemainUaiancc*' Cotnptiter Methods <fi Fn>grams 



How InfomtiKwii^ 

• .Ihe.iiifiWTnaiioiv-clicnt dirchitocture is a niror of ihe-raoro femiUar -^owledgie-Row . ' 
architecture. In knGwlcdg6-servcr.,arch)tecttir^, the syKtbmtesides^K^n tSte network and nniiti^ \ 
jrfc plients access- itTteiotely; Tto infcrerioes/are derived at die server site, while ibpirt suhtoiis- 
fti(m:andpu^(xit di.splay is handled on the cltfeftt'slte (Using lava applets jn Hehrik Bdk^ri's '1 

;'iinjrfen>entatj6n*). THu arohlteciuc^ is appTX>^itriate jTor puMic expert iysteAis;' which coulif be , ' 

.ufiedfSay.tboff-lo^ belpde^bymj^k^ . - ; 

. . Expert tiyjitem Khe) U h^e recofltly hecome oomrac^cia^y avai labld that adopt the kiioi^lf ' . . 

;<^dgc^>le^yer'approadl. Wren (Fxsys Web RuDiinw E*i&inD))t;ls kmnvlodgc bases (rnifJ&tscdled ' 
using ExBys devdk^jcpfiiit toois) be transferred onto Web secviere so Hiat uvers can «cett' ibc 
leiwltibg expert systcxns via the Wbb. Wrtn" {hUp.'/Jwyfw.^XsysinSa: cdm/WrenMrm.htmiy is 
■purely HTWL-hased hriplcmciitatioii thai use* OGl (Common Gateway Intcaface) fprtns to a^t 
the user qUe«kHi»,'i^catooOdcIiisi<n^^ ' • ■ ' ' 

' ; An informatioo-dieni-based system, in conirast, accesses one or mmc Intdnet ^facslo obtain 
pibrtinqoi informodoD or daia,.w}ucbit then uses to assist tof«rence. Ihcfw systeins,h9ve many , 
po»^^ applicatRmti in remote diagoosis, Kusing. muniiorm^, and control. The astH»kiiDicat 
c^mmuolty has implomcntod several systems fbr long-range icinoto pcmtrdl of Eaith-ba&ed tde- 
s<;opes, some iisiogde(ficat^ netwodcff,^ and oibet&:using tb& Wcb.^^ T>c aim of these systems , 
is: (o alleviaie the pFphlcms caused by the ^eogniphicat separation hetwi^. modem i^setrafo- 

.. ries and potential rcsoarchers. Some of these teleseope systems prevent inzuTtizoiniat&iv 6axi^- 
ous instrument rBan(»vera by vaUdat^Dg the icqpc^ - 
the instrunfeot. Althongh these synte^ would beo/tfii from a )cncwled^B^^ft«d ^jp^'t^b; 
expert syitems have not been connected to (lie rcfrKWe-Gomrtoi fawliiics. An^extKn syttteti) • • "] 
been developed ahddcployiSd to roomtw 

' mctit? tn uonttast id InrstAiP^.this ^steni uses a praprictaiy tolemctiy^feccptipA module *, J 
rotber ihan W^bteed techniques for data ocquisitiun. A theowtical aruhiiecttse' wa& -^Iso pd>- ■ - j 
pos^ for multiple instrament diagnosis using *1elemaintettance sorvcrs'" to handle sirtq^le:&ults - ■ 
■ wi& autoMi^c cscalatiqn to remote inaintcnaDco centers 'whcrc neoral nctwurk or knowledge^ * 
'.batH^ systenas naight be used to resolve th? pipbtem.^ Hrst AID is ciinenUy a leacUve system to 
. 'dia^se and remedy faults njice they bai*e ppcurrcd. However,, the LHO control softwoie could 
' he enh^EOced to tran^rnit regular logs tn.semce headquiirtOFK for pit>acti^ faolfdeti^cticHL 
Knoftvledge^bftsedS^'inr^ ; 



mr/JUN! 1998 



71 



tion tests on the working system. We de- j 
scribe Hrst AID's design and implementa- ' 
lion in greater detail elsewhere.^ I 

i 
I 

Remote-control system 

i 




Rgttre I . A sample ndo in the formarBm used to simulate the engineer's reasoning from a set of inrtiol symptoms to 
ony probable causes. The reasoning b done as o series of tests. In this oise, the rule b opplied to determine why the 
image is not sofefodory. 



reasoning approach. Figure 1 gives a sample 
rule in this representation, which determines 
why an image is unsatisfactory. Reasoning 
begins with a set of initial symptoms and ends 
with any probable causes. It is simulated via 
a series of tests that weed out possible causes. 

This representation is a variation of the 
tiaditional rule format: 

• The i f slot contains (in this case) a set 
of syn^)toms. 

• The action and request: slots pro- 
pose an identical test to determine the 
possible cause of the reported* symp- 
tom. A normuU action slot means 
that First AID can conduct the applic- 
able test remotely by caUing the rele- 
vant procedure. 

• The posit ive» negative and 
otlierwise slots contain the conclu- 
sions First AID can draw from a test. 
The otherwise slot provides con- 
clusions when the test has failed. 

• The just if y_ slots contain brief 
explanations for the inferences. 

• The purpose slot contains the reason 
for including the rule. 

• The zone slot classifies rules accord- 
ing to the location of the perceived 
symptom or actual fault. 

The remaining slots are solely for nniainte- 
nance. 

We also developed a small set of metarules 



to control forward chaining. The metarules 
inspect the contents of the bl ackboatd to deter- 
mine the highest priority hypotheses, which 
First AID then uses to select the next nilc to 
fire. 

To implement First AID wc combined 
rapid prototyping with informal user-par- 
ticipative design techniques. We used a 
demonstration prototype to provide the 
proof of concept for our expert system ap- 
proach. Subsequent prototypes incorpo- 
rated many suggestions from engineers to 
improve usability — from minor improve- 
ments, such as rewording explanations and 
user interface labels to make them easier to 
understand* to major issues involving ex- 
pert system security, such as suggesting 
password protection for certain system 
functions. 

Because the knowledge base has only 
185 rules (at the time of publication) and 
because we planned to slowly and incre- 
mentally expand it, we have been able to 
manually verify it and locate any possible 
logical anomalies. We have also subjected 
the inference engine and the user interface 
to rigorous testing to ensure sensible and 
correct behavior. (Once LEO takes over 
knowledge-base expansion and mainte- 
nance, the staff will need an automated 
knowledge-base assistant to support those 
tasks.) Three domain expens have vaUdated 
the knowledge base for accuracy, with the 
leading expert conducting informal valida- 



Figure 2 shows First AID's remote- i 
control system for the LEO 400 series SEMs. 1 
The system has three logical components: the ; 
SEM site, the remote site, and the intervening I 
network. 

At the SEM site (left rectangle in the fig- 
ure), the hypertext documents comprise the * 
interface to the remote SEM operator. We 
have developed software using generic tools : 
such as Netscape, which provides a remote- 
control user interface for SEMs via die Web.^ 
We extended this software using Java applets 
so that it is suitable for remotely monitoring 
instrument parameters as well. The Web 
server provides the hypertext documents to i 
the remote operator*s Web browser. The Web ! 
server transfers requests from the remote 
operator to LEODDE. LEODDE is a Com- 1 
mon Gateway Interface program that trans- ! 
lates requests into dynamic data-exchange | 
requests so that the SEM control software's | 
DDE interface can understand them. Dy- ; 
namic data exchange, which Windows sup- ! 
ports, lets applications executing within a 
ODE environment communicate with each 
other by continuously and automatically 
exchanging data. 

The SEM control software then acts on 
the requests and passes the results back 
through the chain to the Web server, which 
returns them to the remote operator. We 
adapted Cornell University's CUSeeME 
shareware video-conferencing package* to 
provide dynamic image transfer from the 
SEM to the remote operator. 

At die retnote site (far right rectangle in 
the figure), the .service engineer needs a 
graphical Web browser, such as Netscape 
NavigatOT or Internet Explorer, so that he can 
cotmect to the SEM Web server. He also ' 
needs a copy of CUSeeME so that he can • 
receive dynamic images. He can receive sta- 
tic images of up to 1024 x 768 (the SEM's = 
maximum screen resolution) through the ' 
Web browser alone. The Web browser and I 
CUSeeME then handle aU transfer of infor- ^ 
mation and requests over the intervening 
network. ' 

By using generic tools such as Web 
browsers and servers, we could rapidly j 
develop software at tmnimal cost and yet pro- j 
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Rgure 2. How FvH AID fib nto 0 control system for remotely (fiognosing the LEO 400 series scanntng electron mioro- 
scopes. The SEM site IS the location of the microscope. The remote site is the location of the SEM service engineer. Much 
of tfi8 control system uses generic tools like Vfeb browsers arxl servers^ CUSeeME ollows the engine to see dynamic 
images for more effective fault diagnosis. 



vide proven networldiig functionality that is 
suital>Ie for access via botti IcKal area net- 
works and the Internet. 

Integration with Hie Web 

When the remote-control infrastructure 
was stable enough, we began to look at how 
we could make Hnt AID an information client, 
as desdibeU earlier. Spyglass lac. has pro- 
duced a Software Developers Interface {http:// 
www.spyglass.com:4040/t}ewtecnology/ 
integration/iapihiml) tor application program- 
mers to couple the networking facilities of 
standard Web browsers with their software- 
Netscape Communications has implemented 
a slightly modified version of this for the Nav- 
igator browser as a DDE interface {http:// 
searchMetsccq}e.cornMewsrefi^std/ddeapLhtml) 
. Because Wm-Prolog also supports DDE prc^ 
tcxx»ls, we decided to use this mechanism to 
manipulate Netscape and hence the renwte- 
{X)ntrol software. 

We began by exploiting the minimum 
required Netscape functions: 

• start and terminate Netscape, 

• load Web pages, 

• automatically fill in forms and click icons 
on Web p>ages, 

• load files, and 

• retrieve returned data. 

Users can launch any program in En- 
dows using a simple Prolog predicate. They 
can then retrieve returned data for automatic 
processing by saving it as a file for later pars- 
mg. To provide the other functions, we had 
to use some DDE topics (a topic defines a 
DDE conversation). The asynchronous load- 
ing nature of Netscape poses several pitfalls 
and seemingly useful DDE topics are not 
guaranteed to return accurate itiformation 
about the state of a Web-page download. 
Similar difficulties arise when interfacing 
with other tvowsers and AI software. Indeed, 
we were frustrated in recent attempts to 
interface First AID with Micnosoft's Inter- 
net Explorer because Explorer supports only 
a fraction of Spyglasses Software Dcvel- 
oper*s Interface. 

The key DDE topics we have used are 
WWW_Exit, WWW_OpenURL, and WWW_ 
ShowPile. The last two require die system 
to provide the URL of a Web page or the loca- 
tion of a file as well as the identifier of the 
Netscape window to display the download. 



the manner in which the download is to be 
accomplished, and so on. The WWW_ Open 
URL topic can also be used to open Uniform 
Resource Indicators (URIs), which are the 
HTML tags associated with forms and icons. 
Although Win-Prolog suppoas long atoms 
( 1 ,024 characters in tiie current version), DDE 
requests as Microsolt defines them can have at 
naost only 256 characters. This can become a 
problem because a lot of information must be 
encoded in one DDE request to fill in a forai: 
the URl>, the form's URI, the mapping of data 
to form variables, and the necessary control 
uaformation for Netscape. 

The first pitfall with using DDE to con- 
trol Netscape is that communication will suc- 
ceed only if Netscape is actually running. 
Fatal exceptions will occur if the system 
makes a DDE request before Netscape has 
completely started, which includes a fully 
loaded user starting page. Uur solution is to 
provide an error handler to catch the error 
and resend the request 

The second pitfall stems from the differ- 
ence in the time it takes for a Web download 
to complete and the Prolog program to send a 
series of DDE requests. A DDE request to toad 
a specific Web page followed by a second 
request to click on an icon on that Web page 
before it has fully loaded can cause a fatal 
DDE error in Wm-Prolog or a general protec- 
tion fault in Netscape or both. This occurs 
because Netscape is trying to process a request 
it carmot as yet understand. Because depen- 
dent DDE requests can be safely processed 
only after the entire Web page has loaded, we 
force First AID to accept DDE acknowledg- 
ments about the status of the download. We 
do diis by registering Wm-I*rolog as a DDE 
server for the WWW_ crRLEcho topic when 
the WWW_RegisterURLEcho topic is 
invoked. (The WWW_BegiiiProgrees, 



WWW_MakingProgress, andWWW_End- 
Progress topics are not required to provide 
accurate information). Thus Wm-Prolog re- 
ceives a message fipom Netscape every time a 
file is loaded, which includes a message for 
each inline image and complete Web page. 
First AID collects and processes these call- 
backs and transmits dependent DDE requests 
only if the number oi received callbacks 
equals the expected number for the given 
Web page. 

The third pitfall is that incon-ect HTML 
i syntax can produce variable numbers of call- 
• backs, and the user cannot see this discxep- 
: ancy. We have validated the relevant Web 
: documents using HTML syntax checkers. 

How the Web Is used 

Careful use of DDE protocols — 
: although important — cannot produce an ef- 
i fective mformation-client-based expert sys- 
\ tem. First, it is clearly not beneficial to 
suspend all other expen system functions 
: while waiting for a Web page to download. 
Instead, control should remm to the user and 
the inference engine as soon as possible, 
widi the handler processmg DDE callbacks 
in much the same way as it would other low- 
level interruptions. It is also not efficient to 
have to wait for cooperating programs to 
report results from remote tests. The SEM 
operator can usually communicate the test 
results to the service engineer verbally via 
telephone faster than he can transfer them 
over the Internet. 

Figure 3 shows the main consultation 
screen, wliich is the primary screen used chir- 
ing diagnosis. The buttons on the t>ottom half 
reflect choices the remote operator has in 
using First AID: 
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What? lets the engineer deter- 
mine the current inferences and 
facts the system believes. 
How? provides a trace of which 
rules were involved £n deriving 
particular inferences. 
Why? informs tlie engineer of 
the systen)*s purpose in asking a 
question (including which con- 
clusions will be drawn on the 
basis of the engineer's response). 
Justify provides a concise ex- 
planation to support a specific 
conclusion. 

Explain refers the engineer to 
relevant sections of the service 
manual. 

Rule lets the engineer view com- 
plete rules stored in the knowl- 
edge base. 

Purpose provides a brief sum- 
mary of a specific nile*s aim. 
Volunteer lets the service engi- 
neer suggest hypotheses to First 
AID. 

Overrides lets the service engi- 
neer remove casting hypotiieses, 
tell the system to ignore tihem un- 
til further notice, or increase their 
priority. Both this and the volun- 
teer facility are essential in guid- 
ing the consultation because im- 
proved diagnostic infoimation 
aud unpredictable nei%\'ork trans- 
fer mean that the service engineer 
will make inferences sponta- 
neously or later recognize exroDH- 
eCHis ones. 

Gripe is an anonymous feedback 
mechanism that lets the eng^eer 
record criticisms and sugges- 
tions for system improvement. 



.1 



figure 3. first AID's main consuhotion screen. 




First AID in its current state uses 
the Web for four main tasks: remote 
control, remote imagjng^ remote testings and 
online help. The remote control and imaging 
functions are accessible at any time during 
the main part of the consultation; the remote- 
test function is available only svhen a rule that 
permit<t such activity is fired. 

The Control command button lets the 
service engineer access the standard remote- 
control user interface. The Web pages that 
provide the remote-test instmction sequences 
are not for human use because they have been 
optimized for download speed and minimal 
memory requirements. We are still attempt- 



figure 4. What hoppens when the en^neer seleds an iroage-relaled symp- 
lom in the setup window. The text window represents the list of possible 
image compiotnts. 



engineer can manually invoke via 
the Actions button. Most rules in 
the knowledge base include a diag- 
nostic test. About half these rules 
can be conducted remotely. The 
others involve degenerate cases 
such as the inability to start the 
LEO control software because Ihe 
software is corrupt or because an 
instrument must be physically 
adjusted. 

Whenever the inference process 
reaches a rule that does include a 
remote test and the SEM in question 
has the appropriate remote-control 
semp, the user can perform the test 
in either the manual or semiauto- 
matic mode. Manual mode is the de- 
fault when the SEM is not accessible 
through the Internet. It can also be 
used if the vSEM operator does not 
permit Web access or when there are 
network problems. Semiautomatic 
mode explains the command se- 
quences to be invoked and their pro- 
per order. This does not commit the 
engineer to conducting tlie remote 
test nor is he required to invoke all 
sequences. It simply lets him bypass 
commands if information is acquired 
directly from the customer or if he is 
experiencing network failures. He 
can repeatedly attempt any or all por- 
tions of a test before providing First 
AID with a definitive answer about 
the test. 

First AID does not currently sup- 
port an automatic mode, where the 
engineer would have no control over 
the remote-test procedure, but we 
plan to evenmally implement auto- 
matic mechanisms for handling un- 
expected failures at the SEM site or 
within the network. 



ing to acquire and reify the knowledge 
needed to emulate human experts in SEM 
operation. At present, it is more practical to 
use human expertise when full SEM opera- 
tion is required. 

The Image button lets the engineer ac- 
quire static images on demand j5:om the SEM 
and display them in a Netscape window. It 
also lets him launch or terminate CUSeeME 
to view smaller, dynamic images. 

Remote testing. The most important use of 
the Web is for remote testing, which the 



Online help. The second raost important use 
of the Web is to provide online help. Rather 
than incorporate help files directly within the 
expert system or provide help files using the 
traditional hypertext style of Microsoft Help 
fUcs, we decided to produce online help as 
full Hl'ML documents displayed within the 
Web browser. Tfext can thus be supported with 
images and, if necessary, audio or video clips. 

This has several advantages. First, we need 
not burden First AID*s design with the need to 
support vast quantities of factual information 
copied verbatim from ser\'ice manuals and 
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maintained in multiple languages. 
Instead, Brst AID can concentrate on 
its main task, diagnosis. Second, help 
files can contain hypertext links, not 
only to local files on the engineer's PC 
but also to files on secure Web servers 
at LEO headquarters, thus allowing 
access to nooTB infonnation when need- 
ed. Finally, with Web-bosed help facil- 
ities, we can minimize the amount of 
domain-specific help that wUl ac- 
company First AID. This means we 
can maintain First AID and let LEO 
personnel maintain the principal ser- 
vice manuals. Not only is LbO better 
suited for maintaining their manuals, 
but this atrangement also offers a sig- 
nificant security plus; an unprotected 
knowledge base wldi an extended ex- 
planation for each £ault could other- 
wise fall into the hands of competitors. 
Maintaining security and preventing 
excessive overhead from update distri- 
butions are ini^rtant considerations. 



Sample consultation 

To illustrate some of the Web- 
based functionality, we present a 
sample consultation in which a cus- 
tomer telephones a local service 
agency about a problem with his 
SEM. The service engineer starts the 
expert system (which brings up the 
starmp window) and atlen^ to de- 
termine any symptoms. Unfortu- 
nately, neither the customer nor the 
service engineer are fluent in a com- 
mon language, so the service engi- 
neer is able to determine only that the 
problem is some sort of imaging de- 
fiecL From the setup window in Fig- 
ure 4, he selects "image,*" which 
causes the set of image complaints to 
be displayed in the panel at the bot- 
tom of the window. He decides to 
select "poor image quality** as the 
best current description of the prob- 
lem and finalizes this choice by se- 
lecting confirm specific complaints. 

first AID updates its blackboard 
of beliefs and proceeds to the main 
consultation window. Rather than al- 
owing the inference process to begin, the engi- 
neer decides to launch the N^scape browser 
by selecting Connect and requests a static 
image directly from the SEM, while main- 
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figure 5. Stotic SEM Dnoge viewed through Notsoipe. The m^, whdi the 
engineer con view while speduttg to {he custofner (m the phone^ 
groylevek 






OS 


82 

, — ^ 


0 f PS 21 . 



fip^ 6. A dynoink version of the SEM imiige m 
engineer hos reset the instnnneafietting porometm^ 
iflrages shown tfinmgh QISeeME are smoller th(m the stt 
through Netsape, dynonic inoges show the engineer how nrrecfive odions 
ore offectlng the mtcroscope's knoge. In this cose, the engineer can see thot 
resetting the parameters has cou^ the imoge to recover ik gray scab. 



taining a telephone conversation with the cus- 
tomer. Figure 5 shows the image he receives. 

As he inspects the image in Figure 5» the 
engineer notices that it lacks any gray levels. 



which often suggests a failure in the 
image-processing circuit board. He 
next informs First AID of this symp- 
tom using the volunteer facility and 
asserts the belief complaint "image 
lacks gray levels." He signals first 
AID to proceed with inference by 
clicking on proceed. The metarules 
will seek a rule involving a volun- 
teered fact before they seek rules 
involving inferred facts, so Rrst AID 
fires the rule, *\veiidiniager (see fig- 
ure 1). The lule has an associated 
remote test, which the engineer de- 
cides to execute in semiautomatic 
mode. The test involves resetting the 
parameters in the display lookup table 
to their default values. The engineer 
actxvates the reset command and op^is 
a CUSeeME clieitf to view any chai^ 
the reset conunand may cause in the 
image. Figure 6 shows the dynamic 
image he sees at this point After the 
reset comnumd propagates to the target 
SEM to reset the lookup table, the 
image recovers its gray scale. The cus- 
tomer, who is still on the phone, in- 
forms the engineer that the SEM is 
now fully operational. The engineer 
can see the changed image via the 
CUSeeME client To conclude the 
consultation, he selects Disconnect to 
close Netscape and CUSeeME, and 
Ftnisk which prompts First AID to log 
the consultation. The entire consulta- 
tion has probably taken only five min- 
utes and cost a single tel^one call 
Although this sample consuha- 
tion seems simplistic, poor images 
have in reality been misdiagnosed as 
a board failure. The lookup table 
determines the correspondence be- 
tween the discrete signal levels the 
SEM detector captures (and the suc- 
ceeding signal-processing stages) and 
the set of shades or colors seen on the 
screen. As a result, it can radically 
affect perceived brightness, contrast, 
and so on. Expert microscopists use 
the display lookup table to produce 
the highest-quality images so that they 
can, for example, publish them in 
journals. 

Unfortunately, an operator can all 
tcx) easily save these changed kx>kup table set- 
tings inadvertently when shutting down the 
microscope. When he then restarts the micro- 
scope with a different sample, be might be 
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unable to acquire any worthwhile images. Hie 
control software will not indicate that non- 
standard parameters arc being used, so unless 
the operator is familiar with the symptoms, it 
seems to him that the instrmnent has cata- 
strophically failed. . 

la one case of misdiagnosis we know of . 
the manufacturer sent a senior engineer and 
a replacement board from the UK to the cus- 
tomer site. After a few minutes of operating 
the microscope, the engineer discovered the 
real cause, but because of the distances 
involved, he. stayed the rest of the weeit, pro- 
viding additional training in SEM operation. 
The cost of this misdiagnosis was several 
thousand pounds. 



J^VsT AID HAS UNDERGONE HELD 
trials at LEO Giimhridge with the headquarters 
service team; We have since.finalizedthe func- 
tions to be supported and expanded the knowl- 
edge base. We have formally trained domain 
experts, who wiU be responsible for teachihg 
smice aigineers in its use. We have chosen to 
use a pUmrted obsolescence strategy for sched- 
uling maintenance and to ensure that expert 
system security cannot be compromised. Each 
version of first AID will be c^jcrational for nine 
monlhs before it expires permanently. The goal 
is to ensure that only one version ever requires 
technical suppwt at any given time. We expect 
this to simplify and ease the maintenance bur- 
den and, more imi>onant, to ensure that the 
knowledge base is revised and ejqjanded at reg- 
ular intervals. For security, the passwords can 
be changed frequently and any illegal copies 
of First AID will automatically expire. This 
reduces the risk that competitors might gain 
access to the comrncrcially sensitive informa- 
tion within the knowledge base. The updated 
versions will he released to service engineers 
via LEO*s secure bulletin-board system to 
reduce distribution costs. 

The Cambridge University team will also 



assist LEO personnel in determining the extent 
and content of the domain-specific help files 
to be provided, including tlie division between 
information distributed with the expert system 
and information resident on a secure Web 
stjver accessible only to authorized engineers. 

As we described earlier, the remote-con- 
trol software requires a Web server either res- 
ident on the target SEM or on a computer 
linked to the SEM, and the customer's SEM 
must be accessible over tlie Welx LEO has also 
released software (NetSEM) that lets its cus- 
tomers remotely control their own SEMs for 
instniment sharing, collaboration, and other 
applications. We have since updated First AID 
to use NetSEM for remote diagnosis. 

Clearly, issues of security must be ad- 
dressed before such access will be permitted ; 




WEHOFETlMrmETMGIBLE 
BENEFITS OF IMPROVED 
TECHNICAL SUPPORT COMBINED 
WrmSECUBESEmCEWRL 
OVERCOME ANY ANXMIES 
Wmm THE CUSTOMER BASK 

both Cambridge University and LEO are con- 
sidering these. However, the microscopy 
community remains very interested in using 
the Internet for remote microscopy. We hope 
that the tangible benefits of improved techni- 
cal support combined with secure service will 
ovMcome any anxieties within the customer 
base, and are currently awaiting a decision by 
LEO to commit to global deployment of Firist 
AID. 0 
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